Knowledge from Speech Production Used in Speech Technology: Articulatory Synthesis*

نویسنده

  • Richard S. McGowan
چکیده

There appears to be a continuing trend toward incorporating knowledge of speech production into s~eech technology-text-to-speech synthesis (e.g., BIckley, Stevens, & Williams, 1994; Parthasarthy & Coker, 1992), low bit rate coding (see Schroeter & Sondhi, 1992), and automatic speech recognition (e.g., Rose, Schroeter, & Sondhi, 1994; Shirai & Kobayashi, 1986). For automatic speech recognition, using knowledge of the coordination of the vocal tract articulators and the resulting acoustics can reduce apparent token-to-token variability so that general pattern recognition alg.orithms have less work to do. Using artIculatory representations in speech coding has the potential of greatly reducing bit rate because the articulators move relatively slowly and may be described by a few parameters by using an underlying dynamical model or by using simple curve fitting. Finally, text-to-speech synthesis can be improved using articulator control parameters, because the laws ofphysics can be used to produce the correct bundle of acoustic features with a comparatively limited parameterization-the acoustic output is constrained by the laws of physics. All these applications that depend on articulatory representation of speech production, can be grounded in what is called an articulatory synthesizer. An articulatory synthesizer is a device that produces speech output from a set of articulatory parameters (an articulatory representation). These devices are usually implemented in software on a digital computer.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interdisciplinary Approaches for Advancing Articulatory Speech Theory and Synthesis

Articulatory synthesis research has long been dominated by frequency domain and concatenate samplebased speech synthesis techniques. While successful in some domains (e.g., voice-based databases), these techniques still cannot produce natural looking and sounding speech from text for an arbitrary speaker. Natural looking and sounding speech technology is one of the next major milestones in voic...

متن کامل

Perspectives for articulatory speech synthesis

Articulatory speech synthesis currently has two perspectives. (i) Technical perspective: Due to progress in common computer hardware (general increase in computation rate) and software (usability of compilers and simulation software) it is now possible to develop comprehensive phonetic models of speech production reaching nearly real-time for the calculation of acoustic speech signals. Furtherm...

متن کامل

Speech Communication and Speech Technology

Activities in the speech group, including CTT, cover a wide variety of topics, ranging from detailed theoretical development of speech production models through phonetic analyses to practical applications of speech technology. Several theses have been presented during the year spanning a range of research topics including articulatory modelling, multimodal dialogue systems and natural language ...

متن کامل

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Recognizing Speech with Anthropomorphic Models For Voice Synthesis Application to Humanoid Robotics

In order to emulate in robots the speech production and learning capabilities of human infants, exploratory strategies in articulatory synthesizers have been proposed for the creation of acoustic to motor associations. However, commonly used articulatory speech synthesis models are based on an unconstrained modeling of the physiology of the human vocal tract which contain many redundant paramet...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009